Expresg .Mail Label No. EM529199428 US 




UTILITY PATENT APPLICATION TRANSMITTAL 
9 (Large Entity) 

(Only for new nonprovisional applications under 37 CFR 1.53(b)) 



Docket No. 

2204/188 



Total Pages in this Submission 
2$ 



TO THE ASSISTANT COMMISSIONER FOR PATENTS 
Box Patent Application 
Washington, D.C. 20231 

Transmitted herewith for filing under 35 U.S.C. 1 1 1 (a) and 37 C.F.R. 1 .53(b) is a new utility patent application for an 
invention entitled: 




and invented by 



^lf a CONTINUATION APPLICATION, check appropriate box and supply the requisite information: 
[□ Continuation □ Divisional □ Continuation-in-part (CIP) of prior application No. 
.Which is a: 

!□ Continuation □ Divisional □ Continuation-in-part (CIP) of prior application No. 
; Which is a: 

□ Continuation □ Divisional □ Continuation-in-part (CIP) of prior application No. 
Enclosed are: 

Application Elements 

1 . □ Filing fee as calculated and transmitted as described below 

2. S Specification having 



16 



pages and including the following: 



a. 53 Descriptive Title of the Invention 

b. □ Cross References to Related Applications (if applicable) 

c. □ Statement Regarding Federally-sponsored Research/Development (if applicable) 

d. □ Reference to Microfiche Appendix (if applicable) 

e. (3 Background of the Invention 

f- §3 Brief Summary of the Invention 

g. §9 Brief Description of the Drawings (if drawings filed) 

h. [X| Detailed Description 

i- (El Claim(s) as Classified Below 

j- 52 Abstract of the Disclosure 



Page 1 of 3 



P01ULRG/REV04 





UTILITY PATENT APPLICATION TRANSMITTAL 


Docket No. 

2204/JS8 






(Large Entity) 

(Only for new nonprovisional applications under 37 CFR 1.53(b)) 








Total Pages in this Submission 
26 






Application Elements (Continued) 




3. 


a. 
b. 


Drawing(s) (when necessary as prescribed by 35 USC 1 13) 

□ Formal Number of Sheets 

IS Informal Number of Sheets 4 




4. 


H 

a. 


Oath or Declaration 

□ Newly executed (original or copy) IS Unexecuted 






b. 


□ Copy from a prior application (37 CFR 1 .63(d)) (for continuation/divisional application only) 




c. 


IS With Power of Attorney □ Without Power of Attorney 






d. 


□ DELETION OF INVENTOR(S) 

Signed statement attached deleting inventor(s) named in the prior application, 
see 37 C.F.R. 1 .63(d)(2) and 1 .33(b). 


in s. 


□ 


Incorporation By Reference (usable if Box 4b is checked) 

The entire disclosure of the prior application, from which a copy of the oath or declaration is supplied under 
Box 4b, is considered as being part of the disclosure of the accompanying application and is hereby 
incorporated by reference therein. 


'. " 6. 


LI 


Computer Program in Microfiche (Appendix) 






□ 


Nucleotide and/or Amino Acid Sequence Submission (if applicable, all must be included) 


,* 


a. 


□ Paper Copy 






b. 


□ Computer Readable Copy (identical to computer copy) 






c. 


□ Statement Verifying Identical Paper and Computer Readable Copy 

Accompanying Application Parts 




8. 


□ 


Assignment Papers (cover sheet & document(s)) 




9. 


□ 


37 CFR 3.73(B) Statement (when there is an assignee) 




10. 


□ 


English Translation Document (if applicable) 




11. 


□ 


Information Disclosure Statement/PTO-1449 □ Copies of IDS Citations 


12. 


□ 


Preliminary Amendment 




13. 




Acknowledgment postcard 




14. 


□ 


Certificate of Mailing 








□ First Class S3 Express Mail (Specify Label No.): EM529399428 US 






Page 2 of 3 


P01 ULRG/REV04 



UTILITY PATENT APPLICATION TRANSMITTAL 

(Large Entity) 

(Only for new nonprovisional applications under 37 CFR 1.53(b)) 


Docket No. 

2204/188 


Total Pages in this Submission 


Accompanying Application Parts (Continued) 




15. □ Certified Copy of Priority Docu ment(s) (if foreign priority is claimed) 




1 6. □ Additional Enclosures (please identify below): 





Fee Calculation and Transmittal 



CLAIMS AS FILED 



UJ For #Filed #AIIowed #Extra Rate 


Fee 


total Claims 


31 


-20 = 


11 


x SI 8.00 


$198.00 


Indep. Claims 


3 


- 3 = 


0 


x S7S.00 


$0.00 


Multiple Dependent Claims (check if applicable) □ 


$0.00 


t BASIC FEE 


$760.00 


gOTHER FEE (specify purpose) 


$0.00 


TOTAL FILING FEE 


$958.00 



□ A check in the amount of to cover the filing fee is enclosed. 

□ The Commissioner is hereby authorized to charge and credit Deposit Account No. 
as described below. A duplicate copy of this sheet is enclosed. 

□ Charge the amount of as filing fee. 

□ Credit any overpayment. 

□ Charge any additional filing fees required under 37 C.F.R. 1.16 and 1.17. 

□ Charge the issue fee set in 37 C.F.R. 1 .1 8 at the mailing qf the Notice of Allowance, 
pursuant to 37 C.F.R. 1 .31 1 (b). 



Dated: July 15, 1999 





Signature 

Steven G. Saunders, Reg. No. 36,265 
BROMBERG & SUNSTEIN LLP 
125 Summer Street 
Boston, MA 02110 
(617) 443-9292 



cc: 



Page 3 of 3 



P01ULR6/REV04 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



APPLICATION FOR UNITED STATES PATENT 



FOR 



APPARATUS AND METHOD OF 
REGENERATING A 
LOST AUDIO SEGMENT 



Inventor: 

Emre Gunduzhan 

24 Court House Square 
Apartment #907 
Rockville, MD 20850 



Attorney Docket: 2204/188 
(BA-452) 



Attorneys: 

BROMBERG & SUNSTEIN LLP 
125 Summer Street 
Boston, MA 021 10 
(617)443-9292 



2204-188-92929 
July 15, 1999 



-1- 



APPARATUS AND METHOD OF REGENERATING 
A LOST AUDIO SEGMENT 

FIELD OF THE INVENTION 

The invention generally relates to data transmission networks and, more particularly, the 
invention relates to regenerating an audio signal segment in an audio signal transmitted across a 
data transmission network. 

BACKGROUND OF THE INVENTION 

Network devices on the Internet commonly transmit audio signals to other network 
devices ("receivers") on the Internet. To that end, prior to transmission, a given audio signal 
commonly is divided into a series of contiguous audio segments that each are encapsulated 
within one or more Internet Protocol packets. Each segment includes a plurality of samples that 
identify the amplitude of the signal at specific times. Once filled with one or more audio 
segments, each Internet Protocol packet is transmitted to one or more Internet receiver(s) in 
accord with the well known Internet Protocol. 

As known in the art, Internet Protocol packets commonly are lost during transmission 
across the Internet. Undesirably, the loss of Internet Protocol packets transporting audio 
segments often significantly degrades signal quality to unacceptable levels. This problem is 
further exasperated when transmitting a real-time voice signal across the Internet, such as a real- 
time voice signal transmitted during a teleconference conducted across the Internet. 

SUMMARY OF THE INVENTION 

In accordance with one aspect of the invention, a method and apparatus for generating a 
new audio segment that is based upon a given lost audio segment ("given segment") of an audio 
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signal first locates a set of consecutive audio segments in the audio signal. The located set of 
audio segments precede the given audio segment and have a formant The formant then is 
removed from the set of audio segments to produce a set of residue segments having a pitch. The 
pitch and set of residue segments then are processed to produce a new set of residue segments. 
Once produced, the formant of the consecutive audio segments is added to the new set of residue 
segments to produce the new audio segment. The audio signal includes a plurality of audio 
segments. The above noted formant may include a plurality of variable formants. 

In preferred embodiments, the given audio segment is not ascertainable, while its location 
within the audio signal is ascertainable. The audio signal may be any type of audio signal, such 
as a real-time voice signal transmitted across a packet based network. Among other things, the 
audio signal in such case may be a stream of data packets. The pitch of the set of residue 
segments may be determined to generate the audio segment. In some embodiments, the formant 
is removed by utilizing linear predictive coding filtering techniques. In a similar manner, the 
pitch and set of residue segments may be processed by utilizing such linear predictive coding 
filtering techniques. 

The formant preferably is a variable function that has a variable value across the set of 
audio segments. Overlap-add operations may be applied to the new audio segment to produce an 
overlap new audio segment. In further embodiments, the overlap new audio segment may be 
scaled to produce a scaled overlap new audio segment. The scaled overlap new audio segment 
thus replaces the previously noted new audio segment and thus, is a final new audio segment. 
Once produced, the final new segment is added to the audio signal in place of the given audio 
segment. In preferred embodiments, the set of consecutive audio segments immediately precede 
the given audio segment. Stated another way, in this embodiment, there are no audio segments 
between the set of consecutive audio segments and the given audio segment. 

Preferred embodiments of the invention are implemented as a computer program product 
having a computer usable medium with computer readable program code thereon. The computer 
readable code may be read and utilized by the computer system in accordance with conventional 
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processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects and advantages of the invention will be appreciated more 
fully from the following further description thereof with reference to the accompanying drawings 
wherein: 

Figure 1 schematically shows a preferred network arrangement in which two telephones 
transmit real-time voice signals across the Internet. 

Figure 2 schematically shows an audio segment generator configured in accord with 
preferred embodiments of the invention. 

Figure 3 shows a process of generating an audio signal in accord with preferred 
embodiments of the invention. 

Figure 4 shows a preferred process of estimating a set of residue segments of an audio 

signal. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

Figure 1 schematically shows an exemplary data transfer network 10 that may utilize 
preferred embodiments of the invention. In particular, the network 10 includes a first telephone 
12 that communicates with a second telephone 14 via the Internet 16. Each telephone includes a 
segment generator 18 that regenerates lost audio segments from previously received audio 
segments of an audio signal. As previously noted, a segment includes a plurality of audio 
samples. The segment generators 18 may be either internal or external to their respective 
telephones 12 and 14. In preferred embodiments, the segment generators 18 each include a 
computer system for executing conventional computer program code. Such computer system has 
each of the elements commonly utilized for such purpose, including a microprocessor, memory, 
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controllers, etc . . . In other embodiments, the segment generators 18 are hardware devices that 
execute the functions discussed below with respect to figures 3 and 4. 

As noted above, the segment generators 18 utilize previously received audio segments to 
regenerate approximations of lost audio segments of a received audio signal. For example, the 
first telephone 12 may receive a plurality of Internet Protocol packets ("TP packets") transporting 
10 a given real-time voice signal from the second telephone 14. Upon analysis of the received IP 
packets, the first telephone 12 may detect that it had not received all of the necessary IP packets 
to reproduce the entire given signal. Such IP packets that were not received may have been lost 
3 during transmission, thus losing one or more audio segments of the given audio (voice) signal. 
■ : :f As detailed below, the segment generator 18 of the first telephone 12 regenerates the missing one 
W or more audio segments from the received audio segments to produce a set of regenerated audio 
n segments. The set of regenerated audio segments, however, is an approximation of the lost audio 
^ = segments and thus, is not necessarily an exact copy of such segments. Once generated, each 
O segment in the set of regenerated audio segments is added to the given audio signal in its 
s u appropriate location, thus reconstructing the entire signal. If subsequent audio segments are 
2Cjjj similarly lost, the regenerated segment can be utilized to regenerate such subsequent audio 
Q segments. 

It should be noted that two telephones are shown in figure 1 as a simplified example of a 
network 10 that can be utilized to implement preferred embodiments. Accordingly, principles of 
preferred embodiments of the invention can be applied to other network arrangements 

25 transporting packetized data between various network nodes. For example, the network 10 may 
be any public or private network utilizing known transport protocols, such as the aforementioned 
Internet Protocol, Asynchronous Transfer Mode, Frame Relay, and other such protocols. In 
addition to or instead of two telephones, the network 10 may include computer systems, audio 
gateways, or additional telephones. Moreover, the audio transmissions may be any type of audio 

30 transmission, such as a unicast, broadcast, or multicast of any known type of audio signal. 

Figure 2 schematically shows a segment generator 18 configured in accordance with 
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preferred embodiments of the invention to execute the process shown in figure 3. Specifically, 
the segment generator 18 includes an input 20 that receives previous segments of the audio 
signal, and a linear predictive coding analyzer ("LP analyzer 22") that determines the 
characteristics of the formant of the received segments. The LP analyzer 22 preferably utilizes 
autocorrelation analysis techniques commonly employed in the voice signal processing field. 
The LP analyzer 22 consequently forwards the determined formant characteristics to a linear 
predictive filter ("LPC filter 24") that utilizes such characteristics to remove the formant from the 
input segments. In a similar manner, the LP analyzer 22 also forwards the determined formant 
characteristics to an inverse linear predictive filter ("inverse LPC filter 26") that restores the 
formant characteristics to a residue signal (a/k/a "residue segment(s)"). Both the LPC filter 24 
and inverse LPC filter 26 utilize conventionally known methods for performing their respective 
functions. 

In addition to the elements noted above, the segment generator 18 also includes a pitch 
detector 28 that determines the pitch of one or more residue segments, and an estimator 30 that 
utilizes the determined pitch and residue segments to estimate the residue segments of the lost 
audio segments being regenerated. An overlap-add module/scaling module 32 also are included 
to perform conventional overlap-add operations, and conventional scaling operations. In 
preferred embodiments, the pitch detector 28, estimator 30, and overlap-add/scaling module 32 
each utilize conventional processes known in the art. 

Figure 3 shows a preferred process utilized by the segment generator 18 for regenerating 
the lost audio segment(s) of a real-time voice signal. This process makes use of the symmetric 
nature of a person's vocal tract over a relatively short time interval. More particularly, according 
to many well known conventions, a final voice signal is modeled as being a waveform traversing 
through a tube. The tube, of course, is a person's vocal tract, which includes the throat and 
mouth. When passing through the vocal tract, the waveform is modified by the resonances of the 
tract, thus producing the final voice signal. The effect of the vocal tract on the waveform thus is 
represented by the resonances that it produces. These resonances are known in the art as 
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"formants." Accordingly, removing the formant from a final voice signal produces the original 

waveform, which is known in the art as a "residue" or a "residue signal" The residue signal may 

be referred to herein as a set of residue segments. 

As known in the art, the audio signal is broken into a sequence of consecutive audio 

segments for transmission across an IP network. The process shown in figure 3 therefore is 
10 initiated when it is detected, by conventional processes, that one of the audio segments is missing 

from the received sequence of consecutive audio segments. The process therefore begins at step 

300 in which a set of consecutive audio segments that precede the lost segment are retrieved. 
\j The set of retrieved audio segments preferably ranges from a one audio segment to fifteen audio 

segments. In alternative embodiments, each of the audio samples in the 60 - 70 milliseconds of 
W the audio signal immediately preceding the lost audio sample should produce satisfactory results, 
p The segment generator 18 may be preconfigured to utilize any set number of audio segments. 
*; != The set of audio segments preferably includes one or more audio segments that 

O immediately precede the lost segment. A preceding audio segment in the audio signal is 
U considered to immediately precede a subsequent audio segment when there are no intervening 
2(| audio segments between the preceding and subsequent audio segments. The set of audio 
w segments may be retrieved from a buffer (not shown) that stores the audio segments prior to 

processing. 

Once the set of audio segments is retrieved, the process continues to step 302 in which 
the LP analyzer 22 calculates the tract data {i.e., formant data) from the set of segments. As 
25 noted above, the LP analyzer 22 utilizes conventional autocorrelation analysis techniques to 
calculate this data, and forwards such data to the LPC filter 24 and inverse LPC filter 26. The 
process then continues to step 304 in which the formants are removed from the input set of audio 
segments. To that end, the set of audio segments are filtered by the LPC filter 24 to produce a set 
of residue segments. The set of residue segments then are forwarded to both the estimator 30 and 
30 pitch detector 28. 

Accordingly, the process continues to step 306 in which the pitch period of the set of 
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residue segments is determined by the pitch detector 28 and forwarded to the estimator 30. In 
some embodiments, if the pitch detector 28 cannot adequately determine the pitch period of the 
set of residue segments, then it forwards the size of the lost audio segment to the estimator 30. 
The estimator utilizes this alternative information as pitch period information. Once received by 
the estimator 30, both the determined pitch period and the set of residue segments are processed 
to produce a new set of residue segments (a/k/a "residue signal") that approximate both a set of 
residue segments of the lost audio segments, and the residues of the two overlap segments that 
immediately precede and follow the lost audio segment (step 308). 

The estimator 30 may utilize one of many well known methods to approximate the new 
set of residue segments. One method utilized by the estimator 30 is shown in figure 4. Such 
method begins at step 400 in which a set of consecutive samples having a size equal to the pitch 
period is retrieved from the end of the set of residue segments. For example, if the pitch period 
is twenty, then the estimator 30 retrieves the last twenty samples. Then, at step 402, the set of 
samples immediately preceding the set retrieved in step 400 is copied into the new residue signal. 
The size of the set copied at step 402 is equal to the size of the overlap segment that immediately 
precedes the lost audio segment. In the above example, if the size of the overlap segment is 
thirty, then thirty samples that immediately precede the last twenty samples are copied into the 
new residue signal. The process then continues to step 404 in which the set retrieved in step 400 
is added as many times as necessary to the new residue signal to make the size of the new residue 
signal equal to the size of the lost audio segment, plus the sum of the sizes of the two overlap 
segments. Continuing with the above example, if the size of the lost audio segment is seventy 
and the size of the second overlap segment is thirty, then five replicas of the set retrieved in step 
400 are added to the already existing thirty samples. 

Returning to figure 3, once the estimator 30 generates the residue of the lost segments at 
step 308, the process continues to step 310 in which the vocal tract data is added back into the 
newly generated set of residue segments. To that end, the newly generated set of residue 
segments is passed through the inverse LPC filter 26, thus adding the formants of the initially 
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calculated vocal tract. This produces a reproduced set of audio segments that approximate the 
lost set of audio segments. 

The reproduced set of audio segments then may be further processed by the overlap- 
add/scaling module 32 by applying conventional overlap-add and scaling operations to the 
reproduced set. To that end, the middle portion of the reproduced audio signal/segments, which 
approximates the lost audio segment, is scaled and then used to replace the lost audio segment. 
The set of samples before the middle portion is overlapped with and added to the set of samples 
at the end of the set of audio segments retrieved at step 300, thus replacing those samples. The 
set of samples after the middle portion is discarded if the following audio segment also is lost. 
Otherwise, it is overlapped with and added to the set of samples at the beginning of the following 
audio segment, thus replacing those samples. In preferred embodiments, a conventionally known 
Hamming window is used in both overlap/add operations. Once the reproduced set of audio 
segments is generated, it immediately may be added to the audio signal, thus providing an 
approximation of the entire audio signal. 

During testing of the discussed process, satisfactory results have been produced with 
signals having losses of up to about ten percent. It is anticipated, however, that this process can 
produce satisfactory results with audio signals having losses that are greater than ten percent. It 
should be noted that although real-time voice signals are discussed herein, preferred 
embodiments are not intended to be limited to such signals. Accordingly, preferred embodiments 
may be utilized with non-real time audio signals. 

As suggested above, preferred embodiments of the invention may be implemented in any 
conventional computer programming language. For example, preferred embodiments may be 
implemented in a procedural programming language (e.g., "C") or an object oriented 
programming language (e.g., "C++"). Alternative embodiments of the invention may be 
implemented as preprogrammed hardware elements (e.g., application specific integrated circuits 
or digital signal processors), or other related components. 

Alternative embodiments of the invention may be implemented as a computer program 
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product for use with a computer system. Such implementation may include a series of computer 
instructions fixed either on a tangible medium, such as a computer readable media {e.g., a 
diskette, CD-ROM, ROM, or fixed disk), or transmittable to a computer system via a modem or 
other interface device, such as a communications adapter connected to a network over a medium. 
The medium may be either a tangible medium (e.g., optical or analog communications lines) or a 
medium implemented with wireless techniques (e.g., microwave, infrared or other transmission 
techniques). The series of computer instructions preferably embodies all or part of the 
functionality previously described herein with respect to the system. Those skilled in the art 
should appreciate that such computer instructions can be written in a number of programming 
languages for use with many computer architectures or operating systems. Furthermore, such 
instructions may be stored in any memory device, such as semiconductor, magnetic, optical or 
other memory devices, and may be transmitted using any communications technology, such as 
optical, infrared, microwave, or other transmission technologies. It is expected that such a 
computer program product may be distributed as a removable medium with accompanying 
printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer 
system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin 
board over the network (e.g., the Internet or World Wide Web). 

Although various exemplary embodiments of the invention have been disclosed, it should 
be apparent to those skilled in the art that various changes and modifications can be made which 
will achieve some of the advantages of the invention without departing from the true scope of the 
invention. These and other obvious modifications are intended to be covered by the appended 
claims. 
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I claim: / 

X- A method of generating a new audio segment based upon a given audio segment of an 
audio signal, the audio signal having a plurality of audio segments, the method comprising: 

locating a set of consecutive audio segments in the audio signal, the set of consecutive 
audio segments preceding the given audio segment and having a formant; 

removing the formant from the set of audio segments to produce a set of residue segments 
having a pitch; 

processing the pitch and the set of residue segments to produce a new set of residue 
segments; and 

adding the formant of the consecutive set of audio segments to the new set of residue 
segments to produce an output audio segment. 

2. The method as defined by claim 1 wherein the given audio segment is 

not ascertainable, the location of the given audio segment within the audio signal being 
ascertainable. 

3. The method as defined by claim 1 wherein the audio signal is a voice signal transmitted 
across a packet based network, the audio signal being a stream of data packets. 

4. The method as defined by claim 1 further comprising: 
determining the pitch of the set of residue segments. 

5. The method as defined by claim 1 wherein the formant is removed by utilizing linear 
predictive coding filtering techniques. 



6. 



The method as defined by claim 1 wherein the pitch and set of residue segments are 
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processed by utilizing linear predictive coding filtering techniques. 

7. The method as defined by claim 1 wherein the formant is a function having a variable 
value across the set of audio segments. 

8. The method as defined by claim 1 further comprising: 

applying overlap-add operations to the output audio segment to produce an overlap audio 
segment. 

9. The method as defined by claim 8 further comprising: 

scaling the overlap audio segment to produce a scaled audio segment, the scaled audio 
segment being the new audio segment. 

10. The method as defined by claim 1 further comprising: 

adding the output audio segment to the audio signal in place of the given audio segment. 



y, A computer program product for use on a computer system for generating a new audio 
segment based upon a given audio segment of an audio signal, the audio signal having a plurality 
of audio segments, the computer program product comprising a computer usable medium having 
computer readable program code thereon, the computer readable program code including: 

program code for locating a set of consecutive audio segments in the audio signal, the set 
of consecutive audio segments preceding the given audio segment and having a formant; 

program code for removing the formant from the set of audio segments to produce a set 
of residue segments having a pitch; 

program code for processing the pitch and the set of residue segments to produce a new 
set of residue segments; and 

program code for adding the formant of the consecutive set of audio segments to the new 
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set of residue segments to produce an output audio segment. 

1 2. The computer program product as defined by claim 1 1 wherein the given audio segment 
is not ascertainable, the location of the given audio segment within the audio signal being 
ascertainable. 

1 3. The computer program product as defined by claim 1 1 wherein the audio signal is a voice 
signal transmitted across a packet based network, the audio signal being a stream of data packets. 

14. The computer program product as defined by claim 1 1 further comprising: 
program code for determining the pitch of the set of residue segments. 

15. The computer program product as defined by claim 1 1 wherein the program code for 
removing the formant comprises program code for utilizing linear predictive coding filtering 
techniques. 

16. The computer program product as defined by claim 1 1 wherein the program code for 
processing includes program code for utilizing linear predictive coding filtering techniques. 

17. The computer program product as defined by claim 1 1 wherein the formant is a function 
having a variable value across the set of audio segments. 

18. The computer program product as defined by claim 1 1 further comprising: 

program code for applying overlap-add operations to the output audio segment to produce 
an overlap audio segment. 

19. The computer program product as defined by claim 1 8 further comprising: 
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program code for scaling the overlap audio segment to produce a scaled audio segment, 
the scaled audio segment being the new audio segment. 

20, The computer program product as defined by claim 1 1 further comprising: 

program code for adding the output audio segment to the audio signal in place of the 
given audio segment. 



j£\. An apparatus for generating a new audio segment based upon a given audio segment of 
an audio signal, the audio signal having a plurality of audio segments, the apparatus comprising: 

an input to receive a set of consecutive audio segments, the set of consecutive audio 
segments preceding the given audio segment; 

a filter operatively coupled with the input, the filter removing the formant from the set of 
consecutive audio segments to produce a set of residue segments having a pitch; 

a pitch detector operatively coupled with the filter, the pitch detector calculating the pitch 
of the set of residue segments; 

an estimator operatively coupled with the pitch detector, the estimator producing a new 
set of residue segments based upon the set of residue segments and the calculated pitch; and 
an inverse filter operatively coupled with the estimator, the inverse filter adding the 
formant of the consecutive set of audio segments to the new set of residue segments to produce 
an output audio segment. 

22. The apparatus as defined by claim 21 further comprising: 

an analyzer operatively coupled with the input, the analyzer calculating formant values 
for generating the filter. 




23. The apparatus as defined by claim 21 further wherein the given audio segment is 
not ascertainable, the location of the given audio segment within the audio signal being 
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ascertainable. 

24. The apparatus as defined by claim 21 wherein the audio signal is a voice signal 
transmitted across a packet based network, the audio signal being a stream of data packets. 

25. The apparatus as defined by claim 21 wherein the filter utilizes linear predictive coding 
filtering techniques. 

26. The apparatus as defined by claim 21 wherein inverse filter utilizes linear predictive 
coding filtering techniques. 

27. The apparatus as defined by claim 21 wherein the formant is a function having a variable 
value across the set of audio segments. 

28. The apparatus as defined by claim 21 further comprising: 

an overlap add module that applies overlap-add operations to the output audio segment to 
produce an overlap audio segment. 

29. The apparatus as defined by claim 28 further comprising: 

a scaler operatively coupled with the overlap add module, the scaler scaling the overlap 
audio segment to produce a scaled audio segment, the scaled audio segment being the new audio 
segment. 

30. The apparatus as defined by claim 21 further comprising: 

an adder that adds the output audio segment to the audio signal in place of the given 
audio segment 
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31. The apparatus as defined by claim 21 wherein the set of consecutive audio segments 
immediately precede the given audio segment. 



10 



25 
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ABSTRACT OF THE DISCLOSURE 

A method and apparatus for generating a new audio segment that is based upon a given 
audio segment of an audio signal first locates a set of consecutive audio segments in the audio 
signal. The located set of audio segments precede the given audio signal and have a formant. 
The formant then is removed from the set of audio signals to produce a set of residue segments 
having a pitch. The pitch and set of residue segments then are processed to produce a new set of 
residue segments. Once produced, the formant of the consecutive audio segments is added to the 
new set of residue segments to produce the new audio segment. The audio signal includes a 
plurality of audio segments. 
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Declaration and Power of Attorney For Patent Application 

English Language Declaration 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 

first and joint inventor (if plural names are listed below) of the subject matter which is claimed and for 

which a patent is sought on the invention entitled 

Apparatus and Method of Regenerating a Lost Auidio Segment 

0 the specification of which 

iy (check one) 

1 j 18 is attached hereto. 

'•3 □ was filed on as United States Application No. or PCT International 

^ Application Number ______„„^__ 

* and was amended on 

(if applicable) 

I s * I hereby state that I have reviewed and understand the contents of the above identified specification, 
^ including the claims, as amended by any amendment referred to above. 

u I acknowledge the duty to disclose to the United States Patent and Trademark Office all information 
known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 119(a)-(d) or 
Section 365(b) of any foreign application(s) for patent or inventor's certificate, or Section 365(a) of 
any PCT International application which designated at least one country other than the United States, 
listed below and have also identified below, by checking the box, any foreign application for patent or 
inventor's certificate or PCT International application having a filing date before that of the application 
on which priority is claimed. 

Prior Foreign Application(s) Priority Not Claimed 



□ 

(Number) (Country) (Day/Month/Year Filed) 

□ 

(Number) (Country) (Day/Month/Year Filed) 

^____ □ 

(Number) (Country) (Day/Month/Year Filed) 
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I hereby claim the benefit under 35 U.S.C. Section 119(e) of any United States provisional 



application(s) listed below: 


/ Anrilipatinn tonsil Mr* ^ 


fFilina Date^ 


(Application Serial No.) 


(Filing Date) 


(Application Serial No.) 


(Filing Date) 



I hereby claim the benefit under 35 U. S. C. Section 120 of any United States application(s), or 
Section 365(c) of any PCT International application designating the United States, listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
n United States or PCT International application in the manner provided by the first paragraph of 35 
3 U.S.C. Section 112, I acknowledge the duty to disclose to the United States Patent and Trademark 
U Office all information known to me to be material to patentability as defined in Title 37, C. F. R., 
iff Section 1 .56 which became available between the filing date of the prior application and the national 
W or PCT International filing date of this application: 



(Application Serial No.) 


(Filing Date) 


(Status) 




(patented, pending, abandoned) 


(Application Serial No.) 


(Filing Date) 


(Status) 




(patented, pending, abandoned) 


(Application Serial No.) 


(Filing Date) 


(Status) 




(patented, pending, abandoned) 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 
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POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or 

agent(s) to prosecute this application and transact all business in the Patent and Trademark Office 

connected therewith. (list name and registration number) 

Bruce D. Smistein Reg. No. 27,234 Elizabeth P. Moraao Reg. No. 42,904 

Robert M. Asher Meg. No. 30,445 

Timothy M. Murphy Reg. No. 33, 1 98 

Steven G, Saunders Reg. No, 36>26'5 

Harriet M. Strinapel Reg, No. 37,008 

Samuel J. Petudiowski Reg. No. 37,910 

Jeffrey T. Klayman Reg. No. 39,250 

John J. Stickevers Reg, No. 39,387 

Herbert A. Newborn Reg. No. 42,031 

Jean JVL Tibbetts Reg. No. 43 J 93 

Jay Sasdvos Reg. 43,900 



Send Correspondence to: 



Steven G. Saunders 
Bromberg Si Sunstein LLP 
125 Summer Street Boston, 
Boston, MA 02110 



Direct Telephone Calls to: (name and telephone number) 
Steven G, Saunders (617) 443-9292 





Full name of sole or first inventor 
Emre Gunduzhan 




Sole or first inventor's signature 


Date 




Residence 

24 Court House Square, Apt. 907, Rockville, MD 20850 




Citizenship 
Turkey 




Post Office Address 
Same as residence 







Full name of second inventor, if any 



Second inventor's signature Date 
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